Search CORE

40 research outputs found

Biased Competition in Visual Processing Hierarchies: A Learning Approach Using Multiple Cues

Author: A Pouget
Alexander R. T. Gepperth
BY Hayden
CJ McAdams
CM Bishop
D Hoiem
FH Hamker
FH Hamker
G Deco
H Wersing
J Tsotsos
Jannik Fritsch
JH Reynolds
JK Tsotsos
JM Wolfe
K Tanaka
L Itti
L Itti
R Desimone
RD Reed
RS Zemel
S Hochstein
Stephan Hasler
Sven Rebhan
T Michalke
V Navalpakkam
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

In this contribution, we present a large-scale hierarchical system for object detection fusing bottom-up (signal-driven) processing results with top-down (model or task-driven) attentional modulation. Specifically, we focus on the question of how the autonomous learning of invariant models can be embedded into a performing system and how such models can be used to define object-specific attentional modulation signals. Our system implements bi-directional data flow in a processing hierarchy. The bottom-up data flow proceeds from a preprocessing level to the hypothesis level where object hypotheses created by exhaustive object detection algorithms are represented in a roughly retinotopic way. A competitive selection mechanism is used to determine the most confident hypotheses, which are used on the system level to train multimodal models that link object identity to invariant hypothesis properties. The top-down data flow originates at the system level, where the trained multimodal models are used to obtain space- and feature-based attentional modulation signals, providing biases for the competitive selection process at the hypothesis level. This results in object-specific hypothesis facilitation/suppression in certain image regions which we show to be applicable to different object detection mechanisms. In order to demonstrate the benefits of this approach, we apply the system to the detection of cars in a variety of challenging traffic videos. Evaluating our approach on a publicly available dataset containing approximately 3,500 annotated video images from more than 1 h of driving, we can show strong increases in performance and generalization when compared to object detection in isolation. Furthermore, we compare our results to a late hypothesis rejection approach, showing that early coupling of top-down and bottom-up information is a favorable approach especially when processing resources are constrained

Crossref

Springer - Publisher Connector

PubMed Central

Mislocalization of Visual Stimuli: Independent Effects of Static and Dynamic Attention

Author: AA Kosovicheva
Bart Krekelberg
BR Sheth
DM Eagleman
F Ono
FH Hamker
Fuminori Ono
J Pratt
J Ross
Katsumi Watanabe
Kerzel
M Zirnsak
MI Posner
S He
S Suzuki
SR Arnott
Sung-en Chien
T Womelsdorf
TL Hubbard
WM Shim
Y Yamada
Y Yamada
Y Yeshurun
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Shifts of visual attention cause systematic distortions of the perceived locations of visual objects around the focus of attention. In the attention repulsion effect, the perceived location of a visual target is shifted away from an attention-attracting cue when the cue is presented before the target. Recently it has been found that, if the visual cue is presented after the target, the perceived location of the target shifts toward the location of the following cue. One unanswered question is whether a single mechanism underlies both attentional repulsion and attraction effects. We presented participants with two disks at diagonal locations as visual cues and two vertical lines as targets. Participants were asked to perform a forced-choice task to judge targets' positions. The present study examined whether the magnitude of the repulsion effect and the attraction effect would differ (Experiment 1), whether the two effects would interact (Experiment 2), and whether the location or the dynamic shift of attentional focus would determine the distortions effects (Experiment 3). The results showed that the effect size of the attraction effect was slightly larger than the repulsion effect and the preceding and following cues have independent influences on the perceived positions. The repulsion effect was caused by the location of attnetion and the attraction effect was due to the dynamic shift of attentional focus, suggesting that the underlying mechanisms for the retrospective attraction effect might be different from those for the repulsion effect

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks

Author: A Doucet
A Lazar
A Parlos
AH Jazwinski
B Cessac
C Archambeau
C Summerfield
D Debanne
D Mottet
D Perdikis
D Perdikis
D Verstraeten
DV Buonomano
EA Wan
EA Wan
EK Miller
EK Pissadaki
FH Hamker
G Schöner
GE Hinton
H Jaeger
J Daunizeau
J Ting-Ho Lo
JAS Kelso
JL Elman
JT Connor
K Friston
K Friston
K Narendra
K Sidiropoulou
KJ Friston
KJ Friston
KJ Friston
M Bar
M Boerlin
MI Rabinovich
N Spruston
R Blake
R Legenstein
R Wilson
RC Sotero
RP Rao
RP Rao
RPN Rao
S Denève
S Denève
S Rodrigues
S Roweis
Sebastian Bitzer
SJ Kiebel
SJ Kiebel
SJ Kiebel
SJ Kiebel
Stefan J. Kiebel
TB Schön
V Wassenhove van
VK Jirsa
W Maass
Z Ghahramani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Recurrent neural networks (RNNs) are widely used in computational neuroscience and machine learning applications. In an RNN, each neuron computes its output as a nonlinear function of its integrated input. While the importance of RNNs, especially as models of brain processing, is undisputed, it is also widely acknowledged that the computations in standard RNN models may be an over-simplification of what real neuronal networks compute. Here, we suggest that the RNN approach may be made both neurobiologically more plausible and computationally more powerful by its fusion with Bayesian inference techniques for nonlinear dynamical systems. In this scheme, we use an RNN as a generative model of dynamic input caused by the environment, e.g. of speech or kinematics. Given this generative RNN model, we derive Bayesian update equations that can decode its output. Critically, these updates define a 'recognizing RNN' (rRNN), in which neurons compute and exchange prediction and prediction error messages. The rRNN has several desirable features that a conventional RNN does not have, for example, fast decoding of dynamic stimuli and robustness to initial conditions and noise. Furthermore, it implements a predictive coding scheme for dynamic inputs. We suggest that the Bayesian inversion of recurrent neural networks may be useful both as a model of brain function and as a machine learning tool. We illustrate the use of the rRNN by an application to the online decoding (i.e. recognition) of human kinematics

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

MPG.PuRe

Dynamic, Task-Related and Demand-Driven Scene Representation

Author: AL Yarbus
CA Rothkopf
D Soto
DH Ballard
FH Hamker
J Ferrante
J Triesch
JK Tsotsos
JM Henderson
Julian Eggert
L Itti
M Corbetta
M Hayhoe
M Hayhoe
MA Just
MM Chun
R Sedgewick
RA Rensink
RA Rensink
S Amari
S Frintrop
S Ullman
SP Lloyd
Sven Rebhan
V Navalpakkam
V Navalpakkam
Y Aloimonos
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Humans selectively process and store details about the vicinity based on their knowledge about the scene, the world and their current task. In doing so, only those pieces of information are extracted from the visual scene that is required for solving a given task. In this paper, we present a flexible system architecture along with a control mechanism that allows for a task-dependent representation of a visual scene. Contrary to existing approaches, our system is able to acquire information selectively according to the demands of the given task and based on the system’s knowledge. The proposed control mechanism decides which properties need to be extracted and how the independent processing modules should be combined, based on the knowledge stored in the system’s long-term memory. Additionally, it ensures that algorithmic dependencies between processing modules are resolved automatically, utilizing procedural knowledge which is also stored in the long-term memory. By evaluating a proof-of-concept implementation on a real-world table scene, we show that, while solving the given task, the amount of data processed and stored by the system is considerably lower compared to processing regimes used in state-of-the-art systems. Furthermore, our system only acquires and stores the minimal set of information that is relevant for solving the given task

Crossref

Springer - Publisher Connector

PubMed Central

Testing a dynamic field account of interactions between spatial attention and spatial working memory

Author: A Bartels
A Compte
A Hollingworth
AC Riggall
AM Hund
AM Treisman
AR Schutte
AR Schutte
B Tversky
BM Liverence
BR Postle
BR Sheth
C Roggeman
D LaBerge
D LaBerge
DJ Schiano
E Awh
E Awh
E Awh
F Edin
FH Hamker
G Deco
J Driver
J Duncan
J Duncan
J Huttenlocher
J Lipinski
J Lipinski
J-S Hyun
Jeffrey S. Johnson
JHR Maunsell
JM Fuster
JM Wolfe
John P. Spencer
JP Spencer
JP Spencer
JP Spencer
JP Spencer
JT Serences
K Kopecz
KG Claeys
L Huang
L Itti
LG Ungerleider
M Camperi
MA Goodale
MA Just
MC Mozer
MI Posner
PH Engebretson
PS Churchland
R Desimone
RJ Williams
S Amari
S Amari
S Schneegans
S Stigchel Van der
S Werner
SA Hillyard
SA Hillyard
SKU Zibner
T Liu
T Liu
U Neisser
X-J Wang
Y Xiao
Y Xiao
Y-M Zhong
Z Wei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/01/2016
Field of study

Studies examining the relationship between spatial attention and spatial working memory (SWM) have shown that discrimination responses are faster for targets appearing at locations that are being maintained in SWM, and that location memory is impaired when attention is withdrawn during the delay. These observations support the proposal that sustained attention is required for successful retention in SWM: if attention is withdrawn, memory representations are likely to fail, increasing errors. In the present study, this proposal is reexamined in light of a neural process model of SWM. On the basis of the model’s functioning, we propose an alternative explanation for the observed decline in SWM performance when a secondary task is performed during retention: SWM representations drift systematically toward the location of targets appearing during the delay. To test this explanation, participants completed a color-discrimination task during the delay interval of a spatial recall task. In the critical shifting attention condition, the color stimulus could appear either toward or away from the memorized location relative to a midline reference axis. We hypothesized that if shifting attention during the delay leads to the failure of SWM representations, there should be an increase in the variance of recall errors but no change in directional error, regardless of the direction of the shift. Conversely, if shifting attention induces drift of SWM representations—as predicted by the model—there should be systematic changes in the pattern of spatial recall errors depending on the direction of the shift. Results were consistent with the latter possibility—recall errors were biased toward the location of discrimination targets appearing during the delay

Crossref

PubMed Central

University of East Anglia digital repository

Influence of Low-Level Stimulus Features, Task Dependent Factors, and Spatial Biases on Overt Visual Attention

Author: A Açık
A Torralba
AJ Calder
AL Yarbus
AT Bahill
B Sere
B Tatler
B Tatler
BA Olshausen
BG Tabachnick
BT Vincent
BW Tatler
BW Tatler
BW Tatler
C Kanan
C Koch
CA Rothkopf
CM Privitera
D Brockmann
D Gao
D Parkhurst
D Walther
DH Hubel
DJ Felleman
DJ Parkhurst
DJ Parkhurst
ET Rolls
F Cristino
F Gosselin
F Gosselin
F Gosselin
F Schumann
F Schumann
FH Hamker
G Avidan
G Krieger
G Rizzolatti
G Underwood
G Vinette
GL Malcolm
GT Buswell
HP Frey
HP Frey
I Hooge
JK Tsotsos
JM Wolfe
JR Antes
K Tanaka
KA Ehinger
KA Turano
Karl J. Friston
L Itti
L Itti
L Jansen
L Zhang
M Land
MF Land
N Kanwisher
N Tottenham
NH Mackworth
Nora Nortmann
P Reinagel
Peter König
PG Schyns
R Carmi
RB Tootell
RF Murray
RJ Baddeley
RJ Peters
RP Rao
Sepp Kollmorgen
SK Mannan
Sylvia Schröder
T Smith
V Navalpakkam
V Navalpakkam
W Einhäuser
W Einhäuser
W Einhäuser
W Einhäuser
W Einhäuser
W Freiwald
W Kienzle
X Chen
Z Li
Publication venue: Public Library of Science
Publication date: 01/05/2010
Field of study

Visual attention is thought to be driven by the interplay between low-level visual features and task dependent information content of local image regions, as well as by spatial viewing biases. Though dependent on experimental paradigms and model assumptions, this idea has given rise to varying claims that either bottom-up or top-down mechanisms dominate visual attention. To contribute toward a resolution of this discussion, here we quantify the influence of these factors and their relative importance in a set of classification tasks. Our stimuli consist of individual image patches (bubbles). For each bubble we derive three measures: a measure of salience based on low-level stimulus features, a measure of salience based on the task dependent information content derived from our subjects' classification responses and a measure of salience based on spatial viewing biases. Furthermore, we measure the empirical salience of each bubble based on our subjects' measured eye gazes thus characterizing the overt visual attention each bubble receives. A multivariate linear model relates the three salience measures to overt visual attention. It reveals that all three salience measures contribute significantly. The effect of spatial viewing biases is highest and rather constant in different tasks. The contribution of task dependent information is a close runner-up. Specifically, in a standardized task of judging facial expressions it scores highly. The contribution of low-level features is, on average, somewhat lower. However, in a prototypical search task, without an available template, it makes a strong contribution on par with the two other measures. Finally, the contributions of the three factors are only slightly redundant, and the semi-partial correlation coefficients are only slightly lower than the coefficients for full correlations. These data provide evidence that all three measures make significant and independent contributions and that none can be neglected in a model of human overt visual attention

Repository for Publications and Research Data

Crossref

Directory of Open Access Journals

PubMed Central

UCL Discovery

Representing Where along with What Information in a Model of a Cortical Patch

Author: A Anishchenko
A Arieli
A Compte
A Leuschow
A Martinez
A Renart
A Renart
A Samsonovich
A Treisman
A Treves
A Treves
A Treves
A Treves
A Treves
AA Brewer
AD Baddeley
AD Redish
Alessandro Treves
AP Saygin
B Derrida
B Haider
B Haider
B Hellwig
BA Olshausen
BA Wandell
BR Postle
C Hemond
C Holmgren
C van Vreeswijk
C van Vreeswijk
CC Chow
CE Connor
CE Connor
CJ McAdams
CJ McAdams
CM Sylvester
CP Hung
CR Laing
CR Laing
D Somers
D Zoccolan
DJ Amit
DJ Hagler
DY Tsao
DY Tsao
E Marder
EA Dorotheyev
EK Miller
ET Rolls
FH Hamker
FP Battaglia
G Akdal
G Daoudal
G Deco
G Deco
G Rainer
GG Turrigiano
GK Aguirre
H Op de Beeck
H Sompolinsky
I Levy
I Samengo
J Brefczynski
J Buhmann
J Duncan
J Larsson
JB Fritz
JB Sala
JH Reynolds
JJ DiCarlo
JJ Hopfield
JM Fuster
JS McCarley
K Finke
K Grill-Spector
K Hamaguchi
K Koroutchev
K Koroutchev
K Kubota
K Sakai
K Tanaka
K Zhang
KA Hansen
Karl J. Friston
KM Armstrong
M Fyhn
M Riesenhuber
M Riesenhuber
M Riesenhuber
MJ Tovee
MR Evans
MS Bartlett
MV Tsodyks
MV Tsodyks
N Kanwisher
N Kanwisher
N Kanwisher
N Kanwisher
N Muller
N Parga
NC Aggelopoulos
O Shriki
PC Bressloff
PH Schiller
R Azouz
R Ben-Yishai
R Desimone
R Epstein
R Epstein
R Malach
R Tootell
RS Zemel
S Amari
S Gandhi
S Kastner
S Kastner
S McMains
S Treue
SC Rao
SM Stringer
T Sejnowski
V Braitenberg
Y Miyashita
Y Miyashita
Y Roudi
Y Roudi
Y Roudi
Y Roudi
Y Shu
Y Yeshurun
Yasser Roudi
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Behaving in the real world requires flexibly combining and maintaining information about both continuous and discrete variables. In the visual domain, several lines of evidence show that neurons in some cortical networks can simultaneously represent information about the position and identity of objects, and maintain this combined representation when the object is no longer present. The underlying network mechanism for this combined representation is, however, unknown. In this paper, we approach this issue through a theoretical analysis of recurrent networks. We present a model of a cortical network that can retrieve information about the identity of objects from incomplete transient cues, while simultaneously representing their spatial position. Our results show that two factors are important in making this possible: A) a metric organisation of the recurrent connections, and B) a spatially localised change in the linear gain of neurons. Metric connectivity enables a localised retrieval of information about object identity, while gain modulation ensures localisation in the correct position. Importantly, we find that the amount of information that the network can retrieve and retain about identity is strongly affected by the amount of information it maintains about position. This balance can be controlled by global signals that change the neuronal gain. These results show that anatomical and physiological properties, which have long been known to characterise cortical networks, naturally endow them with the ability to maintain a conjunctive representation of the identity and location of objects

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

UCL Discovery

Sissa Digital Library

Incremental grouping of image elements in vision

Author: A Cohen
A Mack
A Mack
A Pooresmaeili
A Sutter
A Thiele
AF Kramer
AG Leventhal
AK Engel
AM Richard
AM Treisman
AM Treisman
AM Treisman
AM Treisman
AO Holcombe
B DeSchepper
B Julesz
BA Eriksen
BC Motter
BJ Baars
BJ Scholl
BJA Palanca
C Bundesen
C Haimson
C Lefebvre
C Malsburg von der
C Malsburg von der
C Russell
CI Baker
CJ McAdams
CM Moore
CP Hung
D Crundall
D Crundall
D Kahneman
D Mumford
DB Liston
DG Pelli
DJ Felleman
DJ Field
DJ Freedman
DL Sheinberg
DY Tsao
E Blaser
E Borenstein
E Kobatake
E Sharon
EK Miller
F Velde van der
FG Ashby
FH Hamker
G Kayaert
G Kreiman
G Sáry
GC Baylis
GC Baylis
GM Ghose
GW Humphreys
H Komatsu
H Supèr
HB Barlow
HE Egeth
HS Scholte
I Kovács
I Kovács
I Rock
ID Gilchrist
J Avrahami
J Beck
J Driver
J Driver
J Driver
J Duncan
J Duncan
J Duncan
J Malik
JB Mattingly
JD Schall
JE Hoffman
JH Reynolds
JJ Knierim
JK Tsotsos
JM Wolfe
JM Wolfe
JM Wolfe
JM Wolfe
JR Bergen
K Fukushima
K Fukushima
K Koffka
K Tanaka
KA Schneider
KE Schmidt
KM Armstrong
KM O'Craven
KR Gegenfurtner
L Chelazzi
L Chelazzi
L Harms
L Itti
LB Ekstrom
LC Sincich
M Behrmann
M Bravo
M Carrasco
M Kubovy
M Pavlovskaya
M Riesenhuber
M Riesenhuber
M Wertheimer
M-E Large
MA Peterson
MB Ben-Av
MJ Bravo
MK Kapadia
MN Shadlen
MS Livingstone
MW Oram
N Cowan
N Donnelly
NP Bichot
O Ben-Shahar
P Cavanagh
P Jolicoeur
P Jolicoeur
P Jolicoeur
P Jolicoeur
P McLeod
PA Salin
Pieter R. Roelfsema
PJ Kellman
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PS Khayat
PW Halligan
R Desimone
R Eckhorn
R Egly
R Houtkamp
R Houtkamp
R Kimchi
R Kimchi
R Pringle
R Sireteanu
RB Ivry
RF Hess
RJ Watt
Roos Houtkamp
RQ Quiroga
S Celebrini
S Coren
S Dehaene
S Dehaene
S Edelman
S Grossberg
S Grossberg
S Ling
S Thorpe
S Treue
S Treue
S Ullman
S Zeki
SE Palmer
SE Palmer
SE Palmer
SE Watson
SJ Luck
SL Brincat
SL Brincat
SL Franconeri
SP Vecera
SP Vecera
SP Vecera
SR Mitroff
SS Wolfson
SW Zucker
T Moore
TD Albright
TJ Buschman
TJ Vickery
U Neisser
VAF Lamme
VAF Lamme
VAF Lamme
VAF Lamme
W Prinzmetal
W Prinzmetal
W Singer
WA Phillips
WH Bosking
WP Banks
WY Chan
Y Sugase
YS Bonneh
Z Gigus
Z Kourtzi
Z Li
ZJ He
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

One important task for the visual system is to group image elements that belong to an object and to segregate them from other objects and the background. We here present an incremental grouping theory (IGT) that addresses the role of object-based attention in perceptual grouping at a psychological level and, at the same time, outlines the mechanisms for grouping at the neurophysiological level. The IGT proposes that there are two processes for perceptual grouping. The first process is base grouping and relies on neurons that are tuned to feature conjunctions. Base grouping is fast and occurs in parallel across the visual scene, but not all possible feature conjunctions can be coded as base groupings. If there are no neurons tuned to the relevant feature conjunctions, a second process called incremental grouping comes into play. Incremental grouping is a time-consuming and capacity-limited process that requires the gradual spread of enhanced neuronal activity across the representation of an object in the visual cortex. The spread of enhanced neuronal activity corresponds to the labeling of image elements with object-based attention

Crossref

VU Research Portal

PubMed Central